Search CORE

173 research outputs found

FASTCUDA: Open Source FPGA Accelerator &amp; Hardware-Software Codesign Toolset for CUDA Kernels

Author: de la Torre E.()
Lavagno L.()
Lazarescu M.()
Mavroidis I. ()
Papaefstathiou I.()
Papaefstathiou Ioannis(http://users.isc.tuc.gr/~ipapaefstathiou)
Schafer F.()
Παπαευσταθιου Ιωαννης(http://users.isc.tuc.gr/~ipapaefstathiou)
Publication venue: IEEE / Institute of Electrical and Electronics Engineers Incorporated:445 Hoes Lane:Piscataway, NJ 08854:(800)701-4333, (732)981-0060, EMAIL: [email protected], INTERNET: http://www.ieee.org, Fax: (732)981-9667
Publication date: 01/01/2012
Field of study

Using FPGAs as hardware accelerators that communicate with a central CPU is becoming a common practice in the embedded design world but there is no standard methodology and toolset to facilitate this path yet. On the other hand, languages such as CUDA and OpenCL provide standard development environments for Graphical Processing Unit (GPU) programming. FASTCUDA is a platform that provides the necessary software toolset, hardware architecture, and design methodology to efficiently adapt the CUDA approach into a new FPGA design flow. With FASTCUDA, the CUDA kernels of a CUDA-based application are partitioned into two groups with minimal user intervention: those that are compiled and executed in parallel software, and those that are synthesized and implemented in hardware. A modern low power FPGA can provide the processing power (via numerous embedded micro-CPUs) and the logic capacity for both the software and hardware implementations of the CUDA kernels. This paper describes the system requirements and the architectural decisions behind the FASTCUDA approach

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Institutional Repository of the Technical University of Crete

Density of States of the lattice Schwinger model

Author: Bañuls M.
Cirac J.
Papaefstathiou I.
Robaina D.
Publication venue: 'American Physical Society (APS)'
Publication date: 01/07/2021
Field of study

MPG.PuRe

Sqrt{shat}_{min} resurrected

Author: A Moraes
A Papaefstathiou
A Papaefstathiou
AJ Barr
AJ Barr
B Allanach
B Allanach
BC Allanach
C Lester
D Stump
D Tovey
DR Tovey
F Abe
F Maltoni
G Aad
G Arnison
G Arnison
H Bachacou
I Antcheva
I Hinchliffe
JA Conley
ML Mangano
P Konar
P Konar
R Corke
T Sjöstrand
T Stelzer
Tania Robens
VD Barger
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2012
Field of study

We discuss the use of the variable sqrt{shat}_{min}, which has been proposed in order to measure the hard scale of a multi parton final state event using inclusive quantities only, on a SUSY data sample for a 14 TeV LHC. In its original version, where this variable was proposed on calorimeter level, the direct correlation to the hard scattering scale does not survive when effects from soft physics are taken into account. We here show that when using reconstructed objects instead of calorimeter energy and momenta as input, we manage to actually recover this correlation for the parameter point considered here. We furthermore discuss the effect of including W + jets and t tbar+jets background in our analysis and the use of sqrt{shat}_{min} for the suppression of SM induced background in new physics searches.Comment: 23 pages, 9 figures; v2: 1 figure, several subsections and references as well as new author affiliation added. Corresponds to published versio

arXiv.org e-Print Archive

Crossref

Queue Management in Network Processors

Author: A Nikologiannis
C Kachris
G Kornaros
I Mavroidis
I Papaefstathiou
T Orphanoudakis
Publication venue
Publication date: 24/04/2020
Field of study

Abstract: -One of the main bottlenecks when designing a network processing system is very often its memory subsystem. This is mainly due to the state-of-the-art network links operating at very high speeds and to the fact that in order to support advanced Quality of Service (QoS), a large number of independent queues is desirable. In this paper we analyze the performance bottlenecks of various data memory managers integrated in typical Network Processing Units (NPUs). We expose the performance limitations of software implementations utilizing the RISC processing cores typically found in most NPU architectures and we identify the requirements for hardware assisted memory management in order to achieve wire-speed operation at gigabit per second rates. Furthermore, we describe the architecture and performance of a hardware memory manager that fulfills those requirements. This memory manager, although it is implemented in a reconfigurable technology, it can provide up to 6.2Gbps of aggregate throughput, while handling 32K independent queues

CiteSeerX

Forward Jets and Energy Flow in Hadronic Collisions

Author: A. Banfi
A. Idilbi
A. Papaefstathiou
A.H. Mueller
B. Blok
B.E. Cox
B.W. Xiao
B.W. Xiao
C. Adloff
C. Ewerz
C.F. Berger
E. Maina
E. Maina
E. Maina
E.L. Berger
F. Dominguez
F. Hautmann
F. Hautmann
F. Hautmann
F. Hautmann
F. Hautmann
F. Hautmann
F. Hautmann
F. Hautmann
F. Hautmann
F. Hautmann
F.A. Ceccopieri
G. Calucci
G. Calucci
G. Calucci
G.P. Salam
H. Jung
H. Jung
I. Cherednikov
I. Cherednikov
I. Sung
I.W. Stewart
J.C. Collins
J.C. Collins
J.M. Butterworth
J.R. Gaunt
J.R. Gaunt
M. Cacciari
M. Deak
M. Diehl
M. Strikman
M.G. Ryskin
P. Skands
P. Skands
P.J. Mulders
S. Abdullin
S. Alioli
S. Catani
S. Catani
S. Catani
S. Catani
S. Catani
S. Catani
S. Chatrchyan
S. Domdey
S. Mantry
S. Mantry
S. Mert Aybat
T. Becher
T. Sjöstrand
T. Sjöstrand
T. Sjöstrand
T.C. Rogers
Y. Hatta
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/12/2011
Field of study

We observe that at the Large Hadron Collider, using forward + central detectors, it becomes possible for the first time to carry out calorimetric measurements of the transverse energy flow due to "minijets" accompanying production of two jets separated by a large rapidity interval. We present parton-shower calculations of energy flow observables in a high-energy factorized Monte Carlo framework, designed to take into account QCD logarithmic corrections both in the large rapidity interval and in the hard transverse momentum. Considering events with a forward and a central jet, we examine the energy flow in the interjet region and in the region away from the jets. We discuss the role of these observables to analyze multiple parton collision effects.Comment: 9 pages, 5 figures. Version2: added results on azimuthal distributions and more discussion of energy flow definition using jet clusterin

arXiv.org e-Print Archive

DESY Publication Database

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

DESY

Oxford University Research Archive

Institutional Repository Universiteit Antwerpen

CERN Document Server

FASTER: Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration

Author: Becker T.
Brokalakis A.
Bruneel K.
Böhm P.
Ciobanu C.
Davidson T.
Gaydadjiev G.
Heyse K.
Luk W.
Niu X.
Papadimitriou K.
Papaefstathiou I.
Pau D.
Pell O.
Pilato C.
Pnevmatikatos D
Santambrogio M.D.
Sciuto D.
Stroobandt D.
Todman T.
Vansteenkiste E.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

The FASTER (Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration) EU FP7 project, aims to ease the design and implementation of dynamically changing hardware systems. Our motivation stems from the promise reconfigurable systems hold for achieving high performance and extending product functionality and lifetime via the addition of new features that operate at hardware speed. However, designing a changing hardware system is both challenging and time-consuming. FASTER facilitates the use of reconfigurable technology by providing a complete methodology enabling designers to easily specify, analyze, implement and verify applications on platforms with general-purpose processors and acceleration modules implemented in the latest reconfigurable technology. Our tool-chain supports both coarse- and fine-grain FPGA reconfiguration, while during execution a flexible run-time system manages the reconfigurable resources. We target three applications from different domains. We explore the way each application benefits from reconfiguration, and then we asses them and the FASTER tools, in terms of performance, area consumption and accuracy of analysis

Archivio istituzionale della ricerca - Politecnico di Milano

Ghent University Academic Bibliography

Chalmers Research

Chalmers Publication Library

Vitamin-V: Virtual Environment and Tool-boxing for Trustworthy Development of RISC-V based Cloud Services

Author: A. Arelakis
A. Call
A. Rigo
A. Savino
A. Scionti
A. Torregrosa
B. Otero
D. Gizopoulos
D. Pnevmatikatos
D. Raho
E. Rodríguez
F. Lubrano
G. Papadimitriou
I. Papaefstathiou
J. Costa
J. L. Berral
J. M. Arnau
K. Nikas
N. Tampouratzis
R. Canal
S. Di Carlo
V. Karakostas
Y. Nikolakopoulos
Publication venue: RISC-V International
Publication date: 01/01/2023
Field of study

Vitamin-V is a 2023-2025 Horizon Europe project that aims to develop a complete RISC-V open-source software stack for cloud services with comparable performance to the cloud-dominant x86 counterpart and a powerful virtual execution environment for software development, validation, verification, and test that considers the relevant RISC-V ISA extensions for cloud deployment

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

RECO level \sqrt{s}_{min} and subsystem \sqrt{s}_{min}: improved global inclusive variables for measuring the new physics mass scale in missing energy events at hadron colliders

Author: A Barr
A Papaefstathiou
A Papaefstathiou
AJ Barr
AJ Barr
AJ Barr
AJ Barr
AJ Barr
B Gripaios
B Webber
BC Allanach
BC Allanach
BK Gjelsten
BK Gjelsten
C Lester
CG Lester
D Costanzo
D Krohn
DJ Miller
DR Tovey
G Polesello
H Bachacou
H-C Cheng
H-C Cheng
H-C Cheng
H-C Cheng
I Hinchliffe
I Hinchliffe
J Alwall
J Alwall
J Alwall
J Alwall
J Alwall
J Alwall
J Hubisz
JM Butterworth
K Agashe
K Hamaguchi
K Kawagoe
Konstantin T. Matchev
KT Matchev
KT Matchev
Kyoungchul Kong
M Burns
M Burns
M Burns
MM Nojiri
MM Nojiri
MM Nojiri
Myeonghun Park
N Kidonakis
P Konar
P Konar
P Konar
Partha Konar
S Chang
SD Ellis
SD Ellis
T Plehn
T Sjöstrand
U Baur
WS Cho
WS Cho
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/06/2010
Field of study

The variable \sqrt{s}_{min} was originally proposed in arXiv:0812.1042 as a model-independent, global and fully inclusive measure of the new physics mass scale in missing energy events at hadron colliders. In the original incarnation of \sqrt{s}_{min}, however, the connection to the new physics mass scale was blurred by the effects of the underlying event, most notably initial state radiation and multiple parton interactions. In this paper we advertize two improved variants of the \sqrt{s}_{min} variable, which overcome this problem. First we show that by evaluating the \sqrt{s}_{min} variable at the RECO level, in terms of the reconstructed objects in the event, the effects from the underlying event are significantly diminished and the nice correlation between the peak in the \sqrt{s}_{min}^{(reco)} distribution and the new physics mass scale is restored. Secondly, the underlying event problem can be avoided altogether when the \sqrt{s}_{min} concept is applied to a subsystem of the event which does not involve any QCD jets. We supply an analytic formula for the resulting subsystem \sqrt{s}_{min}^{(sub)} variable and show that its peak exhibits the usual correlation with the mass scale of the particles produced in the subsystem. Finally, we contrast \sqrt{s}_{min} to other popular inclusive variables such as H_T, M_{Tgen} and M_{TTgen}. We illustrate our discussion with several examples from supersymmetry, and with dilepton events from top quark pair production.Comment: 41 pages, 26 figure

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector